Search CORE

177 research outputs found

Recommended from our members

Global isoform-specific transcript alterations and deregulated networks in clear cell renal cell carcinoma.

Author: Girke Thomas
Hamilton Michael J
Martinez Ernest
Publication venue: eScholarship, University of California
Publication date: 01/05/2018
Field of study

Extensive genome-wide analyses of deregulated gene expression have now been performed for many types of cancer. However, most studies have focused on deregulation at the gene-level, which may overlook the alterations of specific transcripts for a given gene. Clear cell renal cell carcinoma (ccRCC) is one of the best-characterized and most pervasive renal cancers, and ccRCCs are well-documented to have aberrant RNA processing. In the present study, we examine the extent of aberrant isoform-specific RNA expression by reporting a comprehensive transcript-level analysis, using the new kallisto-sleuth-RATs pipeline, investigating coding and non-coding differential transcript expression in ccRCC. We analyzed 50 ccRCC tumors and their matched normal samples from The Cancer Genome Altas datasets. We identified 7,339 differentially expressed transcripts and 94 genes exhibiting differential transcript isoform usage in ccRCC. Additionally, transcript-level coexpression network analyses identified vasculature development and the tricarboxylic acid cycle as the most significantly deregulated networks correlating with ccRCC progression. These analyses uncovered several uncharacterized transcripts, including lncRNAs FGD5-AS1 and AL035661.1, as potential regulators of the tricarboxylic acid cycle associated with ccRCC progression. As ccRCC still presents treatment challenges, our results provide a new resource of potential therapeutics targets and highlight the importance of exploring alternative methodologies in transcriptome-wide studies

eScholarship - University of California

SEED: efficient clustering of next-generation sequences.

Author: Bao Ergude
Girke Thomas
Jiang Tao
Kaloshian Isgouhi
Publication venue: eScholarship, University of California
Publication date: 02/08/2011
Field of study

MotivationSimilarity clustering of next-generation sequences (NGS) is an important computational problem to study the population sizes of DNA/RNA molecules and to reduce the redundancies in NGS data. Currently, most sequence clustering algorithms are limited by their speed and scalability, and thus cannot handle data with tens of millions of reads.ResultsHere, we introduce SEED-an efficient algorithm for clustering very large NGS sets. It joins sequences into clusters that can differ by up to three mismatches and three overhanging residues from their virtual center. It is based on a modified spaced seed method, called block spaced seeds. Its clustering component operates on the hash tables by first identifying virtual center sequences and then finding all their neighboring sequences that meet the similarity parameters. SEED can cluster 100 million short read sequences in <4 h with a linear time and memory performance. When using SEED as a preprocessing tool on genome/transcriptome assembly data, it was able to reduce the time and memory requirements of the Velvet/Oasis assembler for the datasets used in this study by 60-85% and 21-41%, respectively. In addition, the assemblies contained longer contigs than non-preprocessed data as indicated by 12-27% larger N50 values. Compared with other clustering tools, SEED showed the best performance in generating clusters of NGS data similar to true cluster results with a 2- to 10-fold better time performance. While most of SEED's utilities fall into the preprocessing area of NGS data, our tests also demonstrate its efficiency as stand-alone tool for discovering clusters of small RNA sequences in NGS data from unsequenced organisms.AvailabilityThe SEED software can be downloaded for free from this site: http://manuals.bioinformatics.ucr.edu/home/[email protected] informationSupplementary data are available at Bioinformatics online

PubMed Central

eScholarship - University of California

Identification and characterization of endogenous small interfering RNAs from rice

Author: Girke Thomas
Sunkar Ramanjulu
Zhu Jian-Kang
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

RNA silencing-mediated small interfering RNAs (siRNAs) and microRNAs (miRNAs) have diverse natural roles, ranging from regulation of gene expression and heterochromatin formation to genome defense against transposons and viruses. Unlike miRNAs, endogenous siRNAs are generally not conserved between species; consequently, their identification requires experimental approaches. Thus far, endogenous siRNAs have not been reported from rice, which is a model species for monocotyledonous plants. We identified a large set of putative endogenous siRNAs from root, shoot and inflorescence small RNA cDNA libraries of rice. Most of these siRNAs are from intergenic regions, although a substantial proportion (22%) originates from the introns and exons of protein-coding genes. Northern and RT–PCR analysis revealed that the expression of some of the siRNAs is tissue specific or developmental stage specific. A total of 25 transposons and 21 protein-coding genes were predicted to be cis-targets of some of the siRNAs. Based on sequence homology, we also predicted 111 putative trans-targets for 44 of the siRNAs. Interestingly, ∼46% of the predicted trans-targets are transposable elements, which suggests that endogenous siRNAs may play an important role in the suppression of transposon proliferation. Using RNA ligase-mediated-5′ rapid amplification of cDNA end assays, we validated three of the predicted targets and provided evidence for both cis- and trans-silencing of target genes by siRNAs-guided mRNA cleavage

CiteSeerX

Crossref

PubMed Central

Predicting conserved protein motifs with Sub-HMMs

Author: Girke Thomas
Horan Kevin
Shelton Christian R
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

BackgroundProfile HMMs (hidden Markov models) provide effective methods for modeling the conserved regions of protein families. A limitation of the resulting domain models is the difficulty to pinpoint their much shorter functional sub-features, such as catalytically relevant sequence motifs in enzymes or ligand binding signatures of receptor proteins.ResultsTo identify these conserved motifs efficiently, we propose a method for extracting the most information-rich regions in protein families from their profile HMMs. The method was used here to predict a comprehensive set of sub-HMMs from the Pfam domain database. Cross-validations with the PROSITE and CSA databases confirmed the efficiency of the method in predicting most of the known functionally relevant motifs and residues. At the same time, 46,768 novel conserved regions could be predicted. The data set also allowed us to link at least 461 Pfam domains of known and unknown function by their common sub-HMMs. Finally, the sub-HMM method showed very promising results as an alternative search method for identifying proteins that share only short sequence similarities.ConclusionsSub-HMMs extend the application spectrum of profile HMMs to motif discovery. Their most interesting utility is the identification of the functionally relevant residues in proteins of known and unknown function. Additionally, sub-HMMs can be used for highly localized sequence similarity searches that focus on shorter conserved features rather than entire domains or global similarities. The motif data generated by this study is a valuable knowledge resource for characterizing protein functions in the future

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Recommended from our members

Experimental Acute Exposure to Thirdhand Smoke and Changes in the Human Nasal Epithelial Transcriptome: A Randomized Clinical Trial.

Author: Girke Thomas
Kagda Meenakshi S
Pozuelos Giovanna L
Schick Suzaynn
Talbot Prue
Volz David C
Publication venue: eScholarship, University of California
Publication date: 01/06/2019
Field of study

Importance:No previous studies have shown that acute inhalation of thirdhand smoke (THS) activates stress and survival pathways in the human nasal epithelium. Objective:To evaluate gene expression in the nasal epithelium of nonsmoking women following acute inhalation of clean air and THS. Design, Setting, and Participants:Nasal epithelium samples were obtained from participants in a randomized clinical trial (2011-2015) on the health effects of inhaled THS. In a crossover design, participants were exposed, head only, to THS and to conditioned, filtered air in a laboratory setting. The order of exposures was randomized and exposures were separated by at least 21 days. Ribonucleic acid was obtained from a subset of 4 healthy, nonsmoking women. Exposures:By chance, women in the subset were randomized to receive clean air exposure first and THS exposure second. Exposures lasted 3 hours. Main Outcomes and Measures:Differentially expressed genes were identified using RNA sequencing with a false-discovery rate less than 0.1. Results:Participants were 4 healthy, nonsmoking women aged 27 to 49 years (mean [SD] age, 42 [10.2] years) with no chronic diseases. A total of 389 differentially expressed genes were identified in nasal epithelium exposed to THS, while only 2 genes, which were not studied further, were affected by clean air. Enriched gene ontology terms associated with stress-induced mitochondrial hyperfusion were identified, such as respiratory electron transport chain (q = 2.84 × 10-3) and mitochondrial inner membrane (q = 7.21 × 10-6). Reactome pathway analysis identified terms associated with upregulation of DNA repair mechanisms, such as nucleotide excision repair (q = 1.05 × 10-2). Enrichment analyses using ingenuity pathway analysis identified canonical pathways related to stress-induced mitochondrial hyperfusion (eg, increased oxidative phosphorylation) (P = .001), oxidative stress (eg, glutathione depletion phase II reactions) (P = .04), and cell survival (z score = 5.026). Conclusions and Relevance:This study found that acute inhalation of THS caused cell stress that led to the activation of survival pathways. Some responses were consistent with stress-induced mitochondrial hyperfusion and similar to those demonstrated previously in vitro. These data may be valuable to physicians treating patients exposed to THS and may aid in formulating regulations for the remediation of THS-contaminated environments

eScholarship - University of California

What makes species unique? The contribution of proteins with obscure features

Author: Bailey-Serres Julia
Cushman John
Girke Thomas
Gollery Martin
Harper Jeff
Mittler Ron
Mittler Taliah
Zhu Jian-Kang
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Proteins with obscure features (POFs), which lack currently defined motifs or domains, represent between 18% and 38% of a typical eukaryotic proteome. To evaluate the contribution of this class of proteins to the diversity of eukaryotes, we performed a comparative analysis of the predicted proteomes derived from 10 different sequenced genomes, including budding and fission yeast, worm, fly, mosquito, Arabidopsis, rice, mouse, rat, and human. RESULTS: Only 1,650 protein groups were found to be conserved among these proteomes (BLAST E-value threshold of 10(-6)). Of these, only three were designated as POFs. Surprisingly, we found that, on average, 60% of the POFs identified in these 10 proteomes (44,236 in total) were species specific. In contrast, only 7.5% of the proteins with defined features (PDFs) were species specific (17,554 in total). As a group, POFs appear similar to PDFs in their relative contribution to biological functions, as indicated by their expression, participation in protein-protein interactions and association with mutant phenotypes. However, POF have more predicted disordered structure than PDFs, implying that they may exhibit preferential involvement in species-specific regulatory and signaling networks. CONCLUSION: Because the majority of eukaryotic POFs are not well conserved, and by definition do not have defined domains or motifs upon which to formulate a functional working hypothesis, understanding their biochemical and biological functions will require species-specific investigations

Springer - Publisher Connector

PubMed Central

Deciphering the Ubiquitin-Mediated Pathway in Apicomplexan Parasites: A Potential Strategy to Interfere with Parasite Virulence

Author: Chung Duk-Won Doug
Girke Thomas
Horrocks Paul
Le Roch Karine G.
Ponts Nadia
Prudhomme Jacques
Yang Jianfeng
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Reversible modification of proteins through the attachment of ubiquitin or ubiquitin-like modifiers is an essential post-translational regulatory mechanism in eukaryotes. The conjugation of ubiquitin or ubiquitin-like proteins has been demonstrated to play roles in growth, adaptation and homeostasis in all eukaryotes, with perturbation of ubiquitin-mediated systems associated with the pathogenesis of many human diseases, including cancer and neurodegenerative disorders

Keele Research Repository

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

HAL Descartes

eScholarship - University of California

ProdInra

Profiling translatomes of discrete cell populations resolves altered cellular priorities during hypoxia in Arabidopsis

Author: Bailey-Serres Julia
Galbraith David W.
Girke Thomas
Holtan Hans E.
Jang Charles J.H.
Mustroph Angelika
Repetti Peter P.
Zanetti María Eugenia
Publication venue
Publication date: 04/10/2019
Field of study

Multicellular organs are composed of distinct cell types with unique assemblages of translated mRNAs. Here, ribosome-associated mRNAs were immunopurified from specific cell populations of intact seedlings using Arabidopsis thaliana lines expressing a FLAG-epitope tagged ribosomal protein L18 (FLAG-RPL18) via developmentally regulated promoters. The profiling of mRNAs in ribosome complexes, referred to as the translatome, identified differentially expressed mRNAs in 21 cell populations defined by cell-specific expression of FLAG-RPL18. Phloem companion cells of the root and shoot had the most distinctive translatomes. When seedlings were exposed to a brief period of hypoxia, a pronounced reprioritization of mRNA enrichment in the cell-specific translatomes occurred, including a ubiquitous rise in 49 mRNAs encoding transcription factors, signaling proteins, anaerobic metabolism enzymes, and uncharacterized proteins. Translatome profiling also exposed an intricate molecular signature of transcription factor (TF) family member mRNAs that was markedly reconfigured by hypoxia at global and cell-specific levels. In addition to the demonstration of the complexity and plasticity of cell-specific populations of ribosome-associated mRNAs, this study provides an in silico dataset for recognition of differentially expressed genes at the cell-, region-, and organ-specific levels.Instituto de Biotecnologia y Biologia Molecula